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!. OVERVIEW 



The College-Level Academic Skills Test (CLAST) is part of Florida's system of educational accountability 
and is mandated by Section 229.55 l(3)(k), FS. The CLAST is an achievement test measuring students' attainment 
of college-level communication and mathematics skills identified by faculties of community colleges and state univer- 
sities through the College-Level Academic Skills Project (CLASP). The skills (Appendix A) have been adopted 
by the State Board of Education (SBE) through Rule 6A-10.0316, FAC. Provisions for keeping the skills list 
current, m a in ta in i n g active participation of faculty members in the implementation of the testing program, and 
administering the test are provided in the CLAST Test Administration Plan. 

The CLAST consists of four subtests: Essay, English Language Skills (measuring objective writing skills), 
Reading, and Mathematics. Each subtest yields a single score reported to the student and to the institution needing 
the scores. Students also receive broad skill information useful in identifying areas of possible strength or weakness. 
While the CLAST does not yield the skill-by-skill information necessary for full diagnosis of individual student 
needs, institutions can identify areas of need for groups of students by aggregating scores into broad skills over 
several administrations. Although CLAST scores relate positively to other measures of academic performance, they 
do not predict examinees' future performance in upper division programs. 

Since August 1, 1984, students in public institutions in Florida have been required to have CLAST scores 
that satisfy the standards set forth in SBE Rule 6A-10.0312, FAC, for the award of an associate in arts degree and 
for admission to upper division status in a state university in Florida. In addition, students in private institutions 
may need CLAST scores to receive state financial aid. 

Statutes and rules r^rtaining to the CLAST requirement are contained in the CLAST Test Administration 

Plan. 



m Eligibility to Take the CLAST 

The CLAST may be taken by any student who seeks an associate in arts or a baccalaureate degree, has at 
least eighteen credit hours, and applies to take the test by the deadline established for registration. Students who 
have previously taken the CLAST and have not passed all subtests may apply to retake the failed subtest(s) during 
subsequent regular administrations. 

In addition, participating colleges and universities are to register other students who meet either of the 
following criteria: 

1. The students are eligible to participate in a State of Florida financial aid program governed by SBE 
Rule 6A-20.005, FAC. 

2. The students are required under provisions of SBE Rule 6A-20.005, FAC, to have CLAST scores 
to continue their eligibility beyond the academic term in which they register for the CLAST. 

Although CLAST scores are not needed to leceive an associate in science degree, students who are in that 
program may be registered for the CLAST if they satisfy the requirements for (1) the associate in arts degree or (2) 
admission to upper division status. 

In all cases, registration of students for the CLAST must be made in an institution that can determine the 
eligibility of applicants to take the test. Thus, registration normally will be done by the institution in which students 
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are enrolled during the term in which they will take the test. However, an applicant for upper division status at 
a state university who needs CLAST scores and meets other eligibility requirements but is not enrolled in an 
institution that administers the CLAST may be registered for the test in the institution that needs the scores. 

Students must apply to take the test on or before the registration deadline established for that 
administration. Students may not retake any subtest for which they already have a passing score. Students may 
not retake any subtest prior to thirty days from the previous administration of the subtest. 

M Test Administration Plan 

Under provisions of Section 229.551(3)(k), Florida Statutes, the Commissioner of Education maintains 
statewide responsibility for the administration of the CLAST. 

A plan for the administration of the CLAST for the 1992-93 academic year was issued by the Commissioner 
in August 1992. The plan, developed by the Department of Education, assigns administrative responsibility for the 
CLAST at three levels: the Department of Education; the Statewide Test Administrator (a technical support 
contractor); and the community colleges and state universities which administer the test to eligible students. The 
Office of Instructional Resources of the University of Florida is the Statewide Test Administrator. 

The plan also describes the policies and procedures under which the testing program operates. The CLAST 
Test Administration Manual and the CLAST Institutional Test Administrator's Manual, which are made a part of 
the plan, give additional specific information to assist institutional personnel in carrying out their responsibilities. 



II. DEVELOPMENT OF THE CLAST 



The test development process for the CLAST began with identifying skills to be assessed and continues with 
developing items for inclusion in the test. This chapter describes the major developmental efforts culminating in 
the first test administration, the item development procedures, and the development of standards (passing scores). 

MH Background 

In 1979 the Florida Legislature, through Florida Statute 79-222 (now Section 229.551), enacted legislation 
requiring the identification of skills to measure the achievement of essential academic skills of college ^udents. 
The Department of Education then charged the Articulation Coordinating Committee with the task of implementing 
that part of the legislation dealing with the identification of skills and tests to measure achievement of those skills. 
The result was the establishment of the Essential Academic Skills Project (EASP, now CLASP). The EASP included 
an executive committee, a project director, a state-level task force on communication, a state-level task force on 
computation, and a state-level standing committee on student achievement. Members of these initial groups are 
identified in the CLAST Technical Report, 1982-83; current members are identified in Appendix B. 

Ml Identification of Skills 

The state-level task forces, together with the project director and other project personnel acting in an 
advisory capacity, worked to identify essential academic skills that every student, regardless of major, should have 
acquired by the end of the sophomore year. The task forces worked through a series of meetings from January to 
November of 1980 with input from institutional-level task forces that had been established to involve faculty 
members in Florida's public universities and community colleges in the identification of the skills. 

The task forces identified four generic competencies (reading, listening, writing, and speaking) in 
communication and four generic competencies (algorithms, concepts, generalizations, and problem solving) in 
computation. Each generic competency was subsequently reviewed and broad skill categories were developed for 
each competency. 

Skills were then developed for each broad skill category. These skills were presented to a random sample 
of faculty members from broad discipline areas in Florida's public community colleges and universities. Based on 
the results of the survey, the t^k forces made recommendations to the SBE. In September 1981 the SBE adopted 
all of the skills recommended by the task forces. During 1985 and 1989, an extensive review of the CLASP skills 
resulted in the addi tion, deletion, and/or modification of some of the original skills. As a result of the 1985 review, 
revised skills were adopted by the SBE and have been measured by the CLAST since the fall 1987 administration 
(see Table 1); the revised skills resulting from the 1989 review were incorporated into the CLAST's fall 1992 
administration. 
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TABLE 1 


Communication and Computation Competencies and Broad Skills 




COMPXTTATTON 


READING 


ALGORITHMS 


Literal Comprehension 


Arithmetic 


Critical Comprehension 


Geometry and Measurement 




Algebra 


LISTENING 


Statistics, including Probability 


Literal Comprehension 


Logical Reasoning 


Critical Comprehension 






CONCEPTS 




Arithmetic 


Multiple-Choice 


Geometry and Measurement 


Word Choice 


Algebra 


Sentence Structure 


Statistics, mc^din^ Probability 


Grammar, Spelling, 


Logical Reasoning 


Capitalization, and Punctuation 






GENERALIZATION 


essay 


Arithmetic 


Suitability to Purpose and Audience 


Geometry and Measurement 


Effectiveness and Conformity to Standard 


Algebra 


English 


Statistics, including Probability 




Logical Reasoning 


SPEAKING 




Composition of Message 


PROBLEM SOLVING 


Transmission of Message 


Arithmetic 




Geometry and Measurement 




Algebra 




Statistics, ncluding Probability 




Logical Reasoning 



M Review of Available Tests 

Once the skills had been identified, the Standing Committee on Student Achievement, with the assistance 
of project staff, began its task of identifying tests and other assessment procedures that could be used to measure 
achievement of the skills. To accomplish the task, an extensive search was conducted to review commercially 
available tests and tests developed by community colleges and state universities which might be appropriate for 
measuring achievement of communication and computation skills. Sixty-six communication tests and fifty-four 
computation tests were reviewed in depth. Though all of the tests addressed some of the skills, none was judged 
adequate for measuring all of the skills identified in SBE Rule 6A-10.0310, FAC. (Current skills are identified in 
Rule 6A-10.0316, FAC.) 

It was recommended that three multiple-choice subtests be developed in the areas of writing, reading, and 
computation. Since all of the writing skills could not be tested using a multiple-choice format, it was further 
recommended that an essay test be developed to measure the entire set of writing skills. Although it was determined 
that the identified listening and speaking skills should be acquired by students upon completion of their sophomore 
year, no statewide tests were developed to measure student achievement of those skills. 
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tt Development of Test Specifications 



Specifications for a test that could be used to measure the achievement of the skills listed in SBE Rule 
6A-10.0310 (now 6A-10.0316), FAC, were developed between April and August of 1981 by the project director 
and staff, with assistance from the Standing Committee on Student Achievement, the communication and 
computation task forces, and measurement consultants. Recommendations of state-level task force members about 
the assessment of the skills, as well as practical and measurement issues, were considered in determining the nature 
of the subtests and the number of items to be included in each subtest. These same procedures were followed for 
revising the test specifications necessitated by the 1985 and 1989 skill revisions. Specifications for the 1992-93 
forms are described in Chapter m. 

M Development of Item Specifications 

After test specifications were developed, formulation of item specifications began. During the fall of 198 1 , 
item specifications were written for the reading and writing skills as well as for the computation skills dealing with 
algorithms and concepts. In 1983, item specifications for computation skills dealing with generalizations and 
problem solving were written and reviewed. Concurrently, the original specifications for the essay, writing, and 
reading items were reviewed again and revised as necessary. This process was repeated following the 1985 and 
1989 skill revisions. 

All specifications were written by the chairpersons of the state-level task forces with assistance from task 
force members, standing committee members, content and measurement consultants, and Department of Education 
staff. Reviews of the specifications were conducted by faculty members from community colleges and state 
universities. 

Item writers used the item specifications as guides for item content and format. Copies of item 
specifications were distributee for use in all thirty-seven community colleges and state universities to aid faculties 
in planning for instruction and assessment of the skills. Copies of item specifications are available in the institutions 
as well as from the Department of Education. 

M Development of Items 

Items are developed for the CLAST through contracts with postsecondary faculty who write, review, pilot- 
test, and revise items based on item specifications and recommendations of state-level item review committees. 
Items developed under these contracts are submitted to the Department of Education for field-testing and analysis. 
The following procedures are used to develop and approve test items for the CLAST. 

1. A contractor is selected based on its qualifications, including its past performance as an item 
developer and the qualifications of its item writers and reviewers. 

2. The contractor holds a training session for item writers and reviewers to discuss test security issues, 
the purpose of the CLAST, the use of item specifications, characteristics of good test items, item bias 
issues, and specific assignments to the contractor. 

3. Initial drafts of items are written and reviewed by members of the contractor's item writing team. 
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4. Items are pilot-tested with college students, and the results of the pilot test and suggestions from other 
item writers are used in revising the items. The pilot test involves administering each item to about 
thirty students and interviewing at least five of them to obtain specific information about the items. 

5. Based on pilot-test data, items are reviewed and revised by members of the contractor's review team 
who have not been involved in the item writing. Atteciion is given to content, measurement, and 
bias issues (Appendix C). 

6. Revised items are submitted to the Department of Education, and a state-level committee is convened 
to review the items and recommend revisions and/or deletions in the contractor's set. 

7. Based on state-level review, items are revised by the contractor's team and submitted to the 
Department of Education in final form. 

8. Items are then included in the CLAST as developmental items and are not counted as scored items 
for students. This produces classical and Rasch item statistics for evaluating item quality. Items are 
screened based on whether they deviate from the following criteria: p-value greater than or equal to 
.40, point-biserial greater than or equal to .30, Rasch fit-between less than or equal to 3.0, and Rasch 
total fit less than or equal to 1.0 + 3 standard errors. These criteria represent an ideal level of func- 
tioning for an item. If the item point-biserid statistic is less than 0.30, the item may still be 
considered for use on a future examination if it measures an important dimension of a required objec- 
tive. Items are not used if the point-biserial correlation coefficients are close to or less than zero. 

9. Essay topics are field-tested by a qualified contractor. Data generated for topic evaluations include 
distribution of scores, number of essays written, number written off topic, mean score, median score, 
percentage of complete agreement between raters, percentage of agreement within one score point, 
alpha coefficients with and without referee, and reader comments. Topics are evaluated in terms of 
clarity, relevance and appeal to the target population, and suitability for development of an essay; 
topics are also screened for potential biasing elements. The contractor recommends the topics suitable 
for inclusion in the CLAST and identifies any potential problems. 

New items for the CLAST Reading, English Language Skills (ELS), and Mathematics subtests were developed in 
March 1992 through a contract with Florida State University. New topics for the Essay subtest were developed in 
1991. ^ 



In the fall 1992 administration, changes in the ELS test format resulting from the 1989 skills review and 1990 
revision of item specifications were implemented. (Most ELS skills are now tested with items tied to specific 
passages.) The October 1992 Reading and Mathematics subtests also tested new skills resulting from the 1989 skills 
review, although for these subtests there was comparatively little change in format. 

a Development of CLAST Standards 



CLAST standards (passing scores) were set by the SBE in March 1984. The passing scores reflected the 
judgment of a state-level panel of interested persons concerning the minimum level of performance acceptable for 
the successful completion of the sophomore year in community colleges and state universities in Florida. SBE Rule 
6A-10.0312(1), FAC, establishes minimum standards, in terms of scaled scores, for each CLAST subtest for 
specified periods of time (Table 2). Students are required to meet the set of standards in effect when they first take 
the CLAST. 
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TABLE 2 








Standards (Passing Scores) for CLAST Subtests 






'*>*V... ; S6^£DSCQRES " 


••^••TTMBPHRIOD 




engush 








; :> ''; V: - v: " : '"'es^ : ;:' :: 




READING 


MATHEMATICS 


8/1/84- 7/31/86 


4 


265 


260 


260 


8/1/86- 7/31/89 


4 


270 


270 


275 


8/1/89- 9/30/91 


4 


295 


295 


285 


10/1/91-9/30/92 


5* 


295 


295 


290 


10/1/92 and thereafter 


6 


295 


295 


295 



* Established with a revision of tha scoring scale; equivalent to a total score of A on the prior scale. 



These tiers of standards are viewed by state-level panel members as reasonable expectations for all students, 
given the instructional program available to students taking the CLAST during each time period. The CLAST 
Technical Report, 1983-84 provides a foil description of the process through which the standards were developed. 
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III. DESCRIPTION OF THE CLAST 



Each form of the CLAST is developed according to specific guidelines which ensure that test forms from 
one administration to another are parallel in content and that administration procedures are standardized. This 
chapter describes the guidelines. 

B Test Specifications 

For each of the three annual administrations (fall, spring, and summer), a different test is created; however, 
each test measures the same number of items in each broad skill area (Table 3). To increase test security, two forms 
of each test are printed for each administration. Both forms contain the same scored items, but the order of item 
placement is different in each form. Developmental items are embedded in each test form in order to provide data 
needed to add items to the item bank. 

The CLAST comprises four subtests. The Essay subtest is presented in a four-page folder; the English 
Language Skills and Reading subtests are in the same test book, and the Mathematics subtest is in a separate test 
book. 



Mltem Bank 

As items are developed, they are numbered with a nine-digk code identifying the subtest, skill, sequence 
number, and graphic. These items are stored in a card file and a word processing file that are updated as items are 
revised. New items are added to the bank following the review of the developmental items from each 
administration. 

A history and attribute computer file is kept for the item bank and is used in the selection of items for test 
forms and in the test analysis process. The file includes attributes such as the item code, broad skill code, item flag, 
'ate used, and test form. Statistical data include the percentage correct, item point-biserial coefficient, Rasch 
difficulty, fit statistics, and index of discrimination for each item. Data on items are kept in the active file for six 
administrations. After that time, a hard copy and a tape record are stored. The computer bank then is rotated to 
remove the data from the earliest administrations. 

M Test Assembly 

For each administration, items are drawn from the item bank to meet the test specifications. Items are 
selected to minimize the difference in difficulty between forms. Current item difficulty values are used in the 
selection process. Test form item difficulties are centered near zero logits. Small variations in mean difficulty 
occur, particularly in the Reading and ELS subtests, where items are tied to specific passages. Alternate forms are 
adjusted to the common scale by the equating procedures described in Chapter IV. 
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CLAST Spc 


TABLE 3 
rifications by Subtest, 1992-53 




NtlMBEROF 
SKILLS* 


NUMBER OF ITEMS 




'^sboRED::^;-;;- 


I)EVETX)PMENTAL 




ESSAY 

(Holistically scored; not tcsicdwith 
objective items.) 










ENGLISH LANGUAGE SKILLS 
Word Choice 
Sentence Structure 
Grammar, Spelling, 
Capitalization, and Punctuation 
Total 


2 
4 

10 
16 


4 
12 

19 
35 


"T 


40 


READING 

Literal Comprehension 
v^iiucdi v^riiprcncnsion 

Total 


3 
10 
13 


9 
27 
36 


T 


7\ 


MATHEMATICS 

Arithmetic 
Algebra 

Geometry and Measurement 
Logical Reasoning 
Statistics, including Probability 
Total 


13 
17 
10 

8 
9 
57 


13 
16 
7 
7 
7 
50 


T 


55 



* Totals may include subskills and alternate skill formats. 



The plan for format and arrangement of items in test forms is intended to make each form attractive and 
easy to read. Multip" -choice writing items are grouped by format and content to make the test time-efficient for 
students. 



Test Instructions 



General instructions provided to students contain information about scoring, recording answers, the number 
of items, and the time allotted for each subtest. Directions state that scores are based on the number of right 
answers with no correction for guessing. 

The CLAST is administered in one session, which requires nearly five hours. Although actual testing time 
is four hours, additional time is required to check in examinees, code identifying information, distribute and collect 
materials, read directions for each subtest, and provide a ten-minute restroom break. The Essay test is administered 
first, and students are allowed 60 minutes to complete it; the English Language Skills and Reading tests are given 
aext, and 80 minutes are allowed for their completion; the Mathematics test is administered last, and students are 
given 90 minutes to work on it. 

Modifications in test format, such as braille, audio cassette, and large-print materials, are available for 
handicapped students. In addition, the test schedule and admimstration procedures are modified for handicapped 
examinees. Details of these modifications are provided in the CLAST Institutional Test Administrator's Manual. 
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m Quality Control 



Test form quality is maintained through an extensive review process. Drafts of new test forms are reviewed 
by staff of the technical support contractor and the Department of Education. After changes in items and corrections 
are made, there is a thorough review of camera-ready copy, which is followed by a careful review of bluelines. 
Additional information about the performance of the test is taken from f he institutional test administrators' and room 
supervisors' reports and on-site visits to test centers by Department of Education personnel. These reports provide 
information about the quality of test booklets, the standardization of test administrations, and the adequacy of 
allotted test times. 



IV. TECHNICAL CHARACTERISTICS OF THE CLAST 



To preserve comparability of CLAST scores from one administration to the next, test scores are equated 
using a base scale. To ensure reliability and validity of the test and test items, many traditional test analysis 
procedures are used. This section describes the equating process and procedures used to review the reliability and 
validity of the test. 

M Test Score Equating 
The Rasch Model 

The CLAST scale development is based on the logistic response model of Georg Rasch, presented in 
Probabilistic Models for Some Intelligence and Attainment Tests, 1960. Rasch describes a probabilistic model in 
which the probability that a person will answer an item correctly is assumed to be based on the ability of a person 
and the difficulty of the item. These estimates are derived independently and are not related to the particular sample 
of people or of items. When the assumptions of the model are met, tests of unequal difficulty can be equated. 

Rasch model estimates of person ability and item difficulty are obtained using the unconditional maximum 
likelihood estimation procedure described in Wright, Mead, and Bell, BICAL: Calibrating Items With the Rasch 
Model, 1980. The probability of a score X u is expressed as 



exp [X w (B v - 6)} 
1 + exp [3 v - 



where X u = 



a score, B v » person ability, and 6 . = item difficulty. 



Pe, 3on ability in logits represents the natural log odds for succeeding on items which define the scale 
origin. The item difficulty in logits represents the natural log odds for failure on an item by persons with abilities 
at the scale origin. 

One key assumption of the Rasch model is that a test under consideration is unidimensional. That is, it 
measures only one underlying student cognitive ability. Unfortunately, ability is considered to be "latent" and 
cannot be seen or measured in a very precise manner. Therefore, it is important to monitor the performance of the 
test and to conduct studies that will indicate whether the test is likely to be umdimensional. This has been done 
with the CLAST examination in two studies. The first study was performed in 1984 with the Mathematics subtest. 
The second was done in 1986 with the Reading, Mathematics, and ELS subtests. Both studies showed that the use 
of Rasch techniques is justified. 

Calibratio n of Items 

Item dil^culties are obtained by calibrating the scored items for each administration. Three ystematic 
random samples of 700 records are drawn. The items are calibrated, and the item difficulty logits are averaged from 
the three calibration samples. The averaged difficulties are used to adjust the item logits to the October 1982 base 
scale. 
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Item history records are kept in a computer file and updated after each administration. The stability of 
Rasch difficulty, discrimination values, and fit statistics are checked, and items that change values by more than 
.3 logit sure flagged for further inspection. In addition, following each administration, items are re-examined against 
established item screening criteria. 

Newly developed or revised items are embedded within each form of the test and then calibrated and 
adjusted to the base scale. These items are not counted toward examinees' scores end are not included in the initial 
calibrations used to develop the score scale. After the score scale is cre&ied, each test form is recalibrated with both 
the new and the scored items to estimate item difficulties of the new items. The scored items serve as a link 
between the new items in each test form. Item difficulties for the new items are adjusted to the base scale using 
the linking constant derived from the comparison of the calibration of the scored items to their base item difficulties. 
For a complete discussion of the method, see Ryan, J., Equating New Test Forms to an Existing Test, 1981. 

Generation of Ability Estimates 

The traditional estimate of achievement level is the raw score obtained from the number of correct answers 
provided. The Rasch model is used to generate ability estimates corresponding to the traditional test score. 

Adjusted item difficulty logits obtained in item calibration become the basis for estimating person abilities. 
Generation of ability estimates results iu a logit ability scale corresponding to the logit difficulty scale of items. 
Rasch ability logits are derived using the unconditional maximum likelihood estimation procedures of the program 
ABIL-EST (Ryan, 1981). 

The ability estimate corresponding to each raw score between one point and the number of items minus one 
is calculated. (Perfect or zero scores are not included in Rasch calculations.) The ability logit scale is then centered 
at the mean for the October 1982 administration and converted to the standard score scale using a linear 
transformation. 

T 

Linking Sca l ed Scores 

Through the use of Rasch methodology, it is possible to place scores from tests of unequal difficulty on 
the same scale. While the CLAST difficulty is controlled by selecting items having approximately the same average 
and range of difficulty for each administration, some fluctuation in difficulty may occur in order to use items 
representing a broad range of content and difficulty. Differences in test form difficulty are controlled by equating. 

Tests forms given on two different occasions are equated by using information obtained from a subset of 
items common to both forms. These common items are known as "anchor items. " The performance of the two 
groups of examinees on the anchor items is used to adjust the measurement scales for the two forms; the 
measurement scale for the second form is "adjusted" to that of the first form. From a measurement perspective, 
the examinees in both instances took the same form of the test. For the CLAST, all test forms are equated back 
to the first administration of October 1982. With this approach, all students face identical hurdles in that no student 
has the advantage of an "easier" 1 form. 

For each administration, CLAST item difficulties have been adjusted to the base scale of October 1982. 
Item logifc obtained from calibrating the scored items are adjusted by adding the linking constant to each item logit. 
The difference in average difficulty represents the shift in overall difficulty between test forms. This constant is 
added to the current item logits to adjust them to the base scale. The stability of the link is evaluated by comparing 
the difficulty values over time to the values in the base scale. 
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1 Reliability of Scorns 



Reliability is an indicator of the consistency in measurement of student achievement. It provides an 
estimate of the variation in results that can be attributed to random error in measurement. The index of reliability 
is interpreted as the ratio of true-score variance to observed-score variance. Reliability is estimated somewhat 
differently for multiple-choice scores and essay ratings. Procedures used with each type of score are described in 
the following sections. 

Reliability of Multiple-Choice Scores 

The reliability of multiple-choice subtest scores is estimated using the Kuder-Richardson Formula 20 (KR- 
20) coefficient and the standard error of measurement (SEM). The KR-20 coefficient is an internal consistency esti- 
mate of reliability, proposed by Kuder and Richardson in 1937, based on the concept that achievement on items 
drawn from the same content domain should be related. The formula reported as the KR-20 is 



k 

Jfc-1 



where estimated test reliability, k - number of test items, s* « variance of examinees' total scores, 

and E pq = sum of item variances. 

The KR-20 coefficient is appropriate for estimating reliability of scores on multiple-choice tests. However, 
the KR-20 coefficient can be affected by the distribution of scores. For this reason, the SEM is also reported as 
an indicator of reliability for each m iltiple-choice subtest. 

The SEM represents the expected standard deviation of scores for an individual taking a large number of 
randomly selected parallel tests. The mean of the set of scores would represent the individual's true score. 
Therefore, the SEM can be used to estimate confidence intervals axound an individual's true score. Confidence 
intervals applied to obtained scores are not symmetrical about the obtained score, but the estimated true score is 
useful in obtaining the center for a confidence zone to be used with the obtained score. The smaller the SEM, the 
less dispersed are the parallel test scores and the more likely the estimate is close to the individual's true score. 



The formula for computing the SEM is SEM = s t ^l-r tt where s t = standard deviation of the test 
scores and r u - test reliability coefficient. 

The KR-20s and SEMs for the CLAST multiple-choice subtests indicate that they are acceptably reliable 
(Table 4). 



TABLE 4 





English Language Skills 


Reading 


Mathematics 


OCTOBER 


FEBRUARY 


JtBIE 


OCTOBER 


FEBRUARY 


JUKE 


OCTOBER 


FEBRUARY 


JUNE 


KR-20 


.74 


.71 


.77 


.77 


.74 


.75 


.86 


.85 


.84 


SEM 


1.91 


1.93 


2.03 


2.31 


2.20 


2.29 


2.76 


2.75 


2.77 
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Reliability of Essay Ratings / 

Reliability of essay ratings is evaluated in several ways to ensure that raters have adhered to established 
criteria for scoring essays. Consistency in scoring is maintained by training the raters and monitoring the scoring 
process; the reliability of the combined ratings is estimated by coefficient alpha. Both procedures are described 
below. 

Training prior to and during scoring is used to develop and maintain consistency in scoring by the 
individual rater and the group of raters. The scoring process is monitored by checking the assignment of ratings, 
the number of split ratings, and the distribution of ratings of each reader. All papers assigned non-contiguous 
ratings axe submitted to a referee who resolves the split scores. During and after each reading session, reader 
agreement data reflecting the reliability of ratings are reviewed. For the 1992-93 test administrations, the percentage 
of complete agreement between readers for all papers ranged from 42.4 to 43.6, while the percentage of non- 
contiguous scores ranged from 10.3 to 10.5 (Table 5). The complete agreement, by topic, resulting from the 
assignment to a referee of papers with non-contiguous scores was between 46 and 48% (Table 6). 



TABLE 5 

Summary Data for All Essay Readers, 1992-93 




? October 




June 




; PEBCKHX - 


. NUMBER/ 




NUMBER 


; PERCENT 


Total Papers Read 


15,606 


100.0 


18,224 


100.0 


10,202 


100.0 


Non-Contiguous Scores 


1,631 


10.5 


1,898 


10.4 


1,054 


10.3 


Total Agreement Between Readers 


6,799 


43.6 


7,741 


42.5 


4,330 


42.4 



TABLE 6 

Essay Reader Agreement after Referee, 1992-93 




% Complete Agreement: 


% Agreement within One Point 


OCTOBER 


FEBRUARY 


JURE 


OCTOBER 


FEBRUARY 


JUNE 


TOPIC 1 


47 


47 


46 


53 


53 


54 


TOPIC 2 


48 


47 


47 


52 


53 


53 



Reliability of combined ratings for essays is estimated by coefficient alpha, which gives the expected 
correlation between combined ratings of the scoring team and those of a hypothetical parallel team doing the same 
task. The formula is 



k 



1 - 



where r tt = coefficient of reliability, k = number of test items, Lsf = sum of item variances, and s* 
variance of examinees' total scores. 
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Alpha coefficients by topic for the ratings from 1992-93 show they are consistent across topics and ad- 
ministrations (Table 7). 



TABLE 7 
Alpha Coefficients, 1992-93 




Non-Refereed Scores 


Refereed Scores 


OCTOBER 


FKBRDARX 


JOHE 


OCTOBER 


FEBRUARY 


JUNE 


TOPIC 1 


.72 


.71 


.69 


.87 


.86 


.85 


TOPIC 2 


.75 


.71 


.73 


.85 


.85 


.85 



Reliability of Pass/Fail Classification 



Since CLAST scores are used to determine whether students in Florida's community colleges and 
universities have achieved the level of performance required for the award of an associate in arts degree or for 
admission to upper division status, reliability in testing and retesting is an important issue. The reliability issue 
of interest is whether students would consistently pass or would consistently fail if several parallel forms of the test 
were administered to them. The results of a test-retest study conducted in 1984 indicate that the CLAST is reliable 
for making pass/fail decisions based on the 1984-86 standards. A complete report of the study is available from 
the Department of Education, and a summary is available in Appendix D. 



M Item Analysis 

An item analysis such as the one shown in Figure 1 is prepared for the total group of examinees, each 
gender, and each racial/ethnic category. These analyses include the number and percentage of examinees who chose 
each item response, who omitted the item, or who gridded more than one response. In addition, they include item 
difficulty (proportion of examinees choosing the orrect response), item discrimination, and point biserial correla- 
tion. 

Following test administration, preliminary item analyses - > run on the first answer sheets received for 
scoring. Results of these analyses are screened for item flaws or key errors. Clues to such errors are low dis- 
crimination indices or Rasch fit statistics with high values. Other indicators of problems include lack of balance 
in foil distributions or inordinate difficulty. Items exhibiting these characteristics are flagged and, following a 
Department of Education review, may be excluded from scoring. 



Pretesting new items embedded in the test forms is another form of quality control. Before an item is 
added to the bank, it is pretested as a non-scored item, and its item statistics are reviewed. Items not meeting the 
item selection criteria are examined to determine if they are adequate measures of the skills. Any item deemed 
inappropriate is flagged and not used on the CLAST. 
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MATH SCORED TOTAL 



TZH ANALYSIS 



P A 



1 — ITEM RESPONSE FIGURES ARE TOTALS, NOT PERCENTAGES . 



ITEM 
NUMBER 


A 


- 1 T 
B 


E M 

c 


REtl 
D 


1 

2 

3 

a 

5 


801 
1200 
208 
746 
273 


2756 
1*0 1 
275 

9416 + 
98 


2167 
800U + 
316 
230 
8667 + 


14702 + 

826 
9656 + 

61 
1392 


6 
7 
8 
9 
10 


386 
801 
750 
1035 
5519* 


8930+ 
658 
80 
915 

1641 


626 
8731 + 

2U7 
7568 + 
1800 


1481 
262 

9378 + 
921 

1475 


1 1 
12 
13 
14 
15 


3914 
398 
661 
390 

7170* 


5689+ 
100U 
6630 + 
309 
599 


728 
8455 + 
2177 
9628+ 

281 


1 12 
588 

956 

1143 

2398 


16 
17 
18 
19 
20 


1037 
322 

9814 14* 
177 
273 


8353 + 
1 141 
261 
6563 + 
9466 + 


631 
268U 

186 
1931 

618 


14 141 
6292+ 

169 
1788 
90 


21 

22 
23 
24 
25 


557 
;>,498+ 
t867 + 
1?09 

\U2 


1432 
613 
814 
629 

7081 + 


7927 + 
694 
572 
4<)5 
641 


532 
614 
193 
8164 + 
21P^ 



482 
428 
8525 + 
6567 + 
1 108 

5557 + 
1611 

652 
9900 + 

261 

1436 
4504 
4596 + 
85? 
93 

6 127 + 
1462 
133 
1499 
6998 + 

61 
514 
338 
724 
254 



4053 
180 
1 170 
1847 
751 

497 
7788 + 

1399 
335 
1403 

8496+ 
4066 + 
1089 
1433 
197 

2450 
773 
9825 + 

35 
1007 

459 
6528 + 
805 
582 
483 



66 
1303 
496 
1 191 
6034 + 

2654 

361 
7289 + 

134 
2004 

347 
770 
711 
5515 + 
1 76 

414 
8092 + 
294 
214 
693 

9940 + 
1504 
9175 + 
973 
334 



5863 + 
8549 + 
24 7 
828 
2509 

1754 
689 
10/3 
92 
6762 + 

169 
309 
4036 
260 3 
990 2 * 

1470 
113 
205 
8719+ 
1666 

8 

1856 
105 
8119 + 
9113 + 





OK 1 T 


HULT 


0 


!*? 


0 


0 


44 


0 


0 


18 


2 


0 


22 


0 


0 


42 


3 


0 


52 


0 


0 


22 


1 


0 


12 


0 


0 


34 


2 


0 


40 


0 


0 


31 


1 


0 


30 


0 


0 


49 


2 


0 


5 


0 


0 


27 


0 


0 


13 


0 


0 


35 


1 


0 


15 


0 


0 


13 


3 


0 


28 


0 


0 


27 


0 


0 


55 


1 


0 


27 


2 


0 


8 


0 


0 


23 


0 


L 


1 1 


0 


0 


13 


2 


0 


37 


0 


0 


42 


0 


0 


73 


0 


0 


13 


0 


0 


26 


0 


0 


32 


0 


0 


14 


0 


0 


23 


2 


0 


27 


0 


0 


26 


0 


0 


43 


0 


0 


52 


{) 


0 


27 


0 


0 


14 


0 


0 


34 


1 


0 


17 


1 


0 


7 


1 


0 


11 1 


0 


0 


7 


0 


0 


72 


1 


0 


52 


0 


0 


77 


0 


0 


291 


0 



ITEM 
DIFFICULTY 

0.45 
0.76 
0.92 
0.90 
0.83 

0.85 
0.83 
0 90 
0.72 
0.53 

0.54 
0.81 
0.63 
0.92 
0.68 

0.80 
0.60 
0.94 
0.63 
0.90 

0.76 
0.81 
0.85 
0.78 
0.68 

0.56 
0.02 
0.81 
0.63 
0.58 

0.53 
0.74 
0.70 
0.95 
0.65 

0.81 
0.46 
0.44 
0.53 
0.95 

0.58 
0.77 
0.94 
0.83 
0.67 

0.95 
0.62 
0.88 
0.78 
0.87 



ITEM 
DISCRIMINATION 



0.46 
0.38 
0.13 
0.29 
0.37 

0.11 
0.H6 
0.23 
0.37 
0.29 



POINT BISERIAL 
CORRELATION 

0.48 
0.37 
0.33 
0.32 
0.24 

0.38 
0.31 
0,34 
0,41 
0.45 

0.34 
0.35 
0.30 
0.33 
0.34 

0 .45 
0.34 
0,37 
0.22 
0.32 

0.33 
0.48 
0.40 
0.38 
0.2 7 

0.22 
0.42 
0.42 
0.23 
0.30 

0.40 
0.39 
0.36 
0.24 
0.17 

0.27 
0 .2 t 
0.27 
0.29 
0.28 

0. 36 
0.37 
0.24 
0.33 
0.33 

0.25 
0. 30 
0. 30 
0.37 
0.38 



INDICATES CORRECT ANSWER 
INDICATES EVERYONE GIVEN CREDIT 
INDICATES QUESTION THROWN OUT 



Figure L Example of an item analysis. 
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M Preventing Item Bias 



In addition to examining item analyses, review panels established at each stage of test development 
considered the issue of bias in the items. Scatter graphs were examined after each administration to determine if 
particular items operated differently for various racial or ethnic groups. 

A scatter graph (Figure 2) contrasts performance on individual items by racinl/ethnic or gender categories. 
An item difficulty is identified as an outlier i: it deviates substantially from the general relationship for Uie compared 
groups. Consistent differences yi item difficulties may indicate only a difference in the level of achievement among 
the compared groups, but items that deviate from this general pattern are further examined for content bias that may 
be related to gender or racial/ethnic background. 
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Figure 2 » Example of a scatter graph of item difficulties comparing the performance of males with that of females. 



M Validity of Scores 

Strictly speaking, one should not describe a test as being "valid. " Instead, one should describe a test score 
as being "valid" for a particular purpose. Hence, test development operations are designed to build evidence for 
a particular type of score interpretation which is defined in advance. 

Standards for Educational and Psychological Testing (1985) describes three types of validity: content, 
construct, and criterion. Content validity is the only important type for the CLAST because test scores are only 
interpreted in terms of what they indicate about student achievement of designated performance objectives. The 
CLAST does not measure a designated psychological characteristic (e.g., spatial visualization), so construct validity 
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is not relevant. Further, as has been stated, the CLAST was not designed to predict a student's future performance 
in school. Hence, the criterion-related (i.e., predictive) validity is not relevant. Content validity is substantiated 
by determining the extent to which the test items adequately measure the specific skills they are designed to 
measure; that is, the extent to which the content of the test matches the set of skills. The validity of the test is 
established by following the plan and procedures for developing and selecting items for each fonn of the CLAST. 

The general plan used in developing the test is outlined below. 

1. General test specifications, consistent with the purpose of the CLAST, are developed by faculty who 
have expertise in both testing and the content areas (English language skills, reading, and math- 
ematics) with assistance from Department of Education staff. 

2. Item specifications, detailing both the content and the format of items that can be developed to 
measure each of the skills, are developed by faculty with expertise in both the content areas and 
testing, with assistance from Department of Education staff. 

3. Test items are written by faculty according to the guidelines provided by the item specifications and 
are reviewed by faculty and Department of Education staff with careful attention given to content, 
measurement, and bias issues. 

4. Test items are field-tested in community colleges and state universities. 

5. Items are analyzed statistically and selected for use in the test only if they meet criteria established 
by Department of Education staff and testing consultants. 

6. A test plan for selection of items is followed in developing alternate forms of the test. 

7. Scaled scores equated to the reference scale are generated using the Rasch model. 

To summarize, validity of the test as a measure of achievement of the skills is established by following the 
plan for developing and selecting items. Content and testing specialists judge the adequacy of the items for 
measuring the skills, and the plan for selecting items ensures that each form of the CLAST is representative of the 
domain of skills being tested. Scores on each of the subtests, then, can be interpreted to be valid indicators of 
students' achievement of the communication and mathematics skills measured by the CLAST. 
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V. SCORING AND REPORTING PROCEDURES 



Procedures for scoring the CLAST are designed to provide quality control and score scale stability for a 
testing program that has complex scoring and reporting requirements. The process for scoring and reporting reflects 
concern for reliability and comparability of the scores and for appropriate use of the scores. This chapter addresses 
those concerns. 



M Scoring Activities 
Editing Answer Sheets 

Following each administration, as answer sheets are received from each institution, they are edited for 
errors. Answer sheets are read by an NCS Opscan 21 scanner programmed to identity mismarked or miscoded 
sheets. Each identified answer sheet is hand-checked and corrected according to the scoring conventions. 

Rating sheets from holistic scoring of essays are also machine-scored. Editing procedures for holistic 
scoring include a verification of the legitimacy of reader numbers and score codes. Papers with invalid scores or 
with ratings that differ by more than one point are returned to the referee to be corrected and/or reviewed. 

Scoring Conventions 

Within the parameter of number-right scoring, certain conventions are observed: for a response to be 
considered valid, it must be recorded in the answer folder; for a score to be generated on a subtest, at least one 
response must be marked in the appropriate section of the answer sheet; and omits and double grids are counted as 
incorrect. To receive credit for the essay test, students must write on one of the two topics provided, and they must 
write the essay in their answer folders. 

Students' subtest scores below the chance hvel are compared to their other subtest scores. If a score is 
inconsistent with the student's performance on the other subtests, it is hand-checked to determine if the student 
entered the correct form code on the answer sheet. 



U Score Scales 

A three-digit standard scaled score is generated for each administration for each of the multiple-choice 
subtests. Ihe Ssandax 1 score scale is a linear transformation of the Rasch ability logits adjusted for the mean of the 
October 1982 administration. The formula used is 



where S i ■ scaled score, X b * ability logit, and C * October 1982 scale adjustment factor (1.87 for English 
language skills, 1.2 for reading, and 1.0 for mathematics). Raw score to scaled score transformation data are gen- 
erated for each subtest for each administration (Tables 8, 9, and 10). 



S { = 30(X b -O + 300 
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TABLE 8 

ENGLISH LANGUAGE SKILLS SCORE CONVERSIONS, 1992-93 





OCTOBER 


FEBRUARY 




JUNE , : : 








Ability 


* Scaled Sior^^ 


$ Ability: 


; Score 


0 


-6.801 


095 


-6.515 


104 


-6.063 


118 


1 


-5.818 


125 


-5.572 


132 


-5.122 


146 


2 


-5.070 


147 


-4.851 


154 


-4.402 


167 


3 


-4.600 


162 


-4.407 


167 


-3.960 


181 


4 


-4.243 


172 


-4.075 


177 


-3.631 


191 


5 


-3.948 


181 


-3.805 


185 


-3.364 


199 


6 


-3.691 


189 


-3.571 


192 


-3.136 


205 


7 


-3.460 


196 


-3.363 


199 


-2.934 


211 


8 


-3.249 


202 


-3.173 


204 


-2.750 


217 


9 


-3.051 


208 


-2.996 


210 


-2.581 


222 


10 


-2.865 


214 


-2.828 


215 


-2.422 


227 


11 


-2.687 


219 


-2.668 


219 


-2.271 


231 


12 


-2.515 


224 


-2.513 


224 


-2.127 


236 


13 


-2.349 


229 


-2.361 


229 


-1.987 


240 


14 


-2.187 


234 


-2.212 


233 


-1.852 


244 


15 


-2.028 


239 


-2.065 


238 


-1.719 


248 


16 


-1.870 


243 


-1.918 


242 


-1.588 


252 


17 


-1.713 


248 


-1.771 


246 


-1.458 


256 


18 


-1.556 


253 


-1.623 


251 


-1.329 


260 


19 


-1.399 


258 


-1.474 


255 


-1.199 


264 


20 


- 1 . 240 


969 


-1 "399 


9^n 


-1.068 


267 


21 


-1.078 


267 


-1.166 


265 


-0.935 


271 


22 


-0.913 


272 


-1.007 


269 


-U.oUU 


Z/O 


23 


-0.743 


277 


-0.842 


274 


-0.660 


280 


24 


-0.568 


282 


-0.671 


279 


-0.516 


284 


25 


-0.386 


288 


-0.492 


285 


-0.366 


289 


26 


-0.195 


294 


-0.303 


290 


-0.207 


293 


27 


0.007 


300 


-0.101 


296 


-0.038 


298 


28 


0.224 


306 


0.116 


303 


0.145 


304 


29 


0.459 


313 


0.354 


310 


0.347 


310 


30 


0.722 


321 


0.620 


318 


0.575 


317 


31 


1.023 


330 


0.926 


327 


0.841 


325 


32 


1.387 


341 


1.297 


338 


1.169 


335 


33 


1.864 


355 


1.783 


353 


1.610 


348 


34 


2.619 


378 


2.550 


376 


2.330 


369 


35 


3.612 


408 


3.560 


406 


3.270 


398 
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READING SCORE CONVERSIONS, 1992-93 





OCTOBER 


FEBRUARY 


JUNE 


Raw Score 


Ability 


Scaled Score 


Ability 


Scaled Score 


Ability 


Scaled Score 


0 


-5.778 


126 


-5.896 


123 


-5.774 


126 


1 


-4.825 


155 


-4.953 


151 


-4.826 


155 


2 


-4.096 


177 


-4,231 


173 


--.101 


176 


3 


-3.647 


190 


-3.788 


186 


-3.655 


190 


4 


-3.313 


200 


-3.459 


196 


-3.323 


200 


5 


-3.040 


208 


-3.191 


204 


-3.053 


208 


6 


-2.807 


215 


-2.963 


211 


-2.822 


215 


/ 


-2.601 


221 


-2.761 


217 


-2.617 


221 


8 


-2.414 


227 


-2.577 


222 


-2.431 


227 


9 


-2.242 


232 


-2.408 


227 


-2.259 


232 


10 


-2.080 


237 


-2.249 


232 


-2.098 


237 


11 


-1.928 


242 


-2.098 


237 


-1.945 


241 


12 


-1.781 


246 


-1.954 


241 


-1.800 


246 


13 


-1.640 


250 


-1.814 


245 


-1.659 


250 


14 


-1.503 


254 


-1.678 


249 


-1.522 


254 


15 


-1.370 


258 


-1.545 


253 


-1.389 


258 


16 


-1.238 


262 


-1.413 


257 


-1.258 


262 


17 


-1.108 


266 


-1.282 


261 


-1.129 


266 


18 


-0.978 


270 


-1.152 


265 


-1.000 


270 


19 


-0.849 


274 


-1.021 


269 


-0.872 


273 


20 


-0.719 


278 


-0.889 


273 


-0.743 


277 


21 


-0.587 


282 


-0.755 


277 


-0.613 


281 


22 


-0.454 


286 


-0.619 


281 


-0.482 


285 


23 


-0.317 


290 


-0.479 


285 


-0.348 


289 


24 


-0.177 


294 


-0.335 


289 


-0.210 


293 


25 


-0.032 


299 


-0.186 


294 


-0.067 


297 


26 


0.120 


303 


-0.030 


299 


0.082 


302 


27 


0.280 


308 


0.135 


304 


0.238 


307 


28 


0.451 


313 


0.311 


309 


0.406 


312 


29 


0.636 


319 


0.501 


315 


0.587 


317 


30 


0.839 


325 


0.710 


321 


0.786 


323 


31 


1.069 


332 


0.945 


328 


1.012 


330 


32 


1.338 


340 


1.219 


336 


1.276 


338 


33 


1.668 


350 


1.554 


346 


1.601 


348 


34 


2.112 


363 


2.003 


360 


2.040 


361 


35 


2.834 


385 


2.730 


381 


2.757 


382 


36 


| 3.778 


413 


3.681 


410 


3.693 


410 
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TABLE 10 










MATHEMATICS SCORE CONVERSIONS, 1992-93 






OCTOBER 


FEBRUARY 


JUNE 


Raw Score 


Ability 


Scaled Score 


Ability 


: Scaled Score 


Ability 


Scaled Score 


o 


-6.042 


118 


-6.250 


112 


-6.030 


1 

119 


l 


-5.112 


146 


-5.295 


141 


-5.104 


146 


2 


-4.398 


168 


-4.564 


163 


-4393 


168 


3 


-3.966 


181 


4.116 


176 


-3.963 


181 


4 


-3.650 


190 


-3.786 


186 


-3.648 


190 


5 


-3.396 


198 


-3.520 


194 


-3.396 


198 


6 


-3.182 


204 


-3.295 


201 


-3.184 


204 


7 


-2.996 


210 


-3.098 


207 


-2.999 


210 


8 • 


-2.829 


215 


-2.921 


212 


-2.834 


214 


9 


-2.678 


219 


-2.760 


217 


-2.683 


219 


10 


-2.538 


223 


-2.611 


221 


-2545 


223 


11 


-2.408 


227 


-2.472 


225 


-2.415 


227 


12 


-2.285 


231 


-2.341 


229 


-2.293 


231 


X3 


-2.169 


234 


-2.216 


233 


-2.178 


234 


14 


-2.058 


238 


-2.096 


237 


-2.067 


237 


15 


-1.951 


241 


-1.981 


240 


-1.961 


241 


16 


-1.848 


244 


-1.870 


243 


-1.858 


244 


17 


-1.748 


247 


-1.762 


247 


-1.758 


247 


18 


-1.651 


250 


-1.657 


250 


-1.661 


250 


19 


-1-556 


253 


-1.553 


253 


-1.565 


253 


20 


-1.463 


256 


-1.452 


256 


-1.471 


255 


21 


-1.371 


258 


-1.352 


259 


-1.379 


258 


22 


-1.281 


261 


-1.254 


262 


-1.287 


261 


23 


-1.191 


264 


-1.156 


265 


-1.196 


264 


24 


-1.102 


266 


-1.059 


268 


-1.106 


266 


25 


-1.014 


269 


-0.962 


271 


-1.015 


269 


26 


-0.925 


272 


-0.865 


274 


-0.924 


272 


27 


n ok 


114 


-0.768 


276 


-0.833 


275 


28 


-U. /4 / 


a § 


-0.671 


279 


-0.741 


277 


29 


-U.CO / 


ZiSU 


-0.5 fl 


282 


-0.648 


280 


30 


-UJO / 


ZoZ 


-0.473 


285 


-0554 


283 


31 


_ll AHA 


Zoo 


-0.372 


288 


-0,458 


286 


32 


-U.JO l 


mOO 


-0.270 


291 


-0.359 


289 


33 


-U.ZoJ 


^yi 


-0.165 


295 


-0.259 


292 


34 






-0.058 


298 


•0,155 


295 


35 


-0.085 


297 


0.052 


301 


•0.048 


.98 


36 


0.019 


300 


0.166 


304 


0.063 


301 


37 


0.128 


303 


0.285 


308 


0.178 


305 


38 


0.242 


307 


■ 0.408 


312 


0.300 


309 


39 


0.362 


310 


0.539 


316 


0.428 


312 


40 


0.490 


314 


0.677 


320 


0.565 


316 


41 


0.627 


318 


0.825 


324 


0.711 


321 


42 


0.775 


323 


0.985 


329 


0.870 


326 


^3 


0.938 


328 


1.160 


334 


1.045 


331 


44 


1.121 


333 


1.356 


340 


1.240 


337 


45 


1.331 


339 


1.580 


347 


1.463 




46 


1.580 


347 


1.844 


355 


1.727 


351 


47 


1.893 


356 


2,172 


365 


2.055 


361 


48 


2.320 


369 


2.617 


378 


2500 


375 


49 


3,029 


390 


3.344 


400 


3.227 


396 


50 


3,951 


418 


4.293 


428 


4.176 


425 



- 24 - 

26 



The score scale ranges fro~» approximately 100 points to 400 points. It is centered at 300 points, 
designating the state average score on the October 1982 administration. All subsequent examinations are equated 
to this administration. Differences in scaled score ranges across test forms occur as a result of differences in the 
range of item difficulty in test forms. The difficulty of each form is controlled, however, so that these shifts in the 
average score range are small. If one test form has items that are more difficult, it is possible to obtain a higher 
scaled score because the harder items measure a higher level of achievement. 

The essay score is assigned on a scale of two to twelve points. Two readers rate each essay on a rating 
scale from one to six points. The essay score is the sum of the two ratings. The holistic scoring procedure and 
rating scale are discussed in the next section. 



M Essay Scoring 

Holistic scoring or evaluation, a process forjudging the quality of writing samples, has been used for many 
years by testing agencies in credit-by-examination, state assessment, and teacher certification programs. 

Holistic Scores 

Essays are scored holistically — that is, for the total, overall impression they make on the reader — rather 
than analytically, which requires careful analysis of specific features of a piece of writing. Holistic scoring assumes 
that the skills which make up the ability to write are closely interrelated and that one skill cannot be separated from 
the others. Thus, the writing is viewed as a total work in which the whole is something more than the sum of the 
parts. A reader reads a writing sample once, forms an impression of its overall quality, and assigns it a numerical 
rating based on his/her judgment of how well the paper meets a particular set of established criteria. A six-point 
scale reflecting the following performance levels is used to score CLAST essays. 

Score of 6. The paper presents or implies a thesis that is developed with noticeable coherence. The writer's ideas 
are usually substantive, sophisticated, and carefully elaborated. The writer's choice of language and 
structure is precise and purposeful, often to the point of being polished. Control of sentence 
structure, usage, and mechanics, despite an occasional flaw, contributes to the writer's ability to 
communicate the purpose. 

Score of 5. The paper presents or implies a thesis and provides convincing, specific support. The writer's ideas 
are usually fresh, mature, and extensively developed. The writer demonstrates a command of 
language and uses a variety of structures. Control of sentence structure, usage, and mechanics, 
despite an occasional flaw, contributes to the writer's ability to communicate the purpose. 

Score of 4. The paper presents a thesis and often suggests a plan of development, which is usually carried out. 

The writer provides enough supporting detail to accomplish the purpose of the paper. The writer 
makes competent use of language and sometimes varies sentence structure. Occasional errors in 
sentence structure, usage, and mechanics do not interfere with the writer's ability to communicate the 
purpose. 

?core <rf ?. The paper presents a thesis and often suggests a plan of development, which is usually carried out. 

The writer provides support that tends toward generalized statements or a listing. In general, the 
support is neither sufficient nor clear enough to be convincing. Sentence structure tends to be 
pedestrian and often repetitious. Errors in sentence structure, usage, and mechanics sometimes 
interfere with the writer's ability to communicate the purpose. 
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Score of 2. The paper usually presents a thesis. The writer provides support that tends to be sketchy and/or 
illogical. Sentence structure may be simplistic and disjointed. Errors in sentence structure, usage, 
and mechanics frequently interfere with the writer's ability to communicate the purpose. 

Score of 1. The paper generally presents a thesis that is vaguely worded or weakly asserted. Support, if any, 
tends to be rambling and/or superficial. The writer uses language that often becomes tangled, 
incoherent, and thus confusing. Errors in sentence structure, usage, and mechanics frequently occur. 

Holistic Scoring 

The holistic scoring session must be conducted in a highly organized manner with competent staff members 
who have clearly specified responsibilities. For ten thousand essays, the holistic scoring staff consists of a chief 
reader, three assistant chief readers, twenty table leaders, and one hundred readers. A support staff of a manager 
and five clerks is also required. 

The scoring procedure follows this pattern. Prior to the scoring session, the chief reader and assistants 
sample the total group of essays to choose from each of the two topics examples which clearly rep^sent the estab- 
lished standards for each of the six ratings on the rating scale. These essays are known as range finders. In 
addition, other essays are chosen as training materials during the scoring sessions. 

After range finders and samples are selected, table leaders meet with the chief and assistant chief readers 
to score the samples and determine if the samples clearly represent the six levels of the scale. The purpose of this 
session is to refine the sample selection and to ensure consensus among table leaders. Range finders from previous 
administrations are also reviewed and used in the training to ensure consistency in scoring from one administration 
to another. 

Immediately prior to and intermittently throughout the scoring session, the chief reader trains the readers 
using the range finders and other samples. Immediately after the initial training session, scoring begins. Each essay 
is read by two readers, each of whom assigns it a rating of one, two, three, four, five, or six. The sum of the 
ratings is the total score assigned to the essay. A total score of five or above is passing for examinees first taking 
the CLAST before October 1992. 

In situations where the two readers' individual ratings differ by more than one point, the essay is read by 
a third reader, the referee. The referee's rating will replace one of the existing two ratings for a revised total sccre. 

A more complete description of the process is in A Training Manual for the Holistic Scoring of the Essay 
Subtest of the College-Level Academic Skills Test, available in the Department of Education office. 

Recruitment of Readers 

Each institution that registers students for the CLAST may participate in the holistic scoring process. The 
chief reader solicits nominations for readers from the chairs of English departments in community colleges and 
universities. Nominations for readers are made on the basis of the candidate's interest in the process, willingness 
to set aside personal standards forjudging the quality of writing and to undergo training, and availability to work 
over weekends. Candidates must have a minimum of two years' experience teaching composition, hold at least a 
master's degree or equivalent, have a major in English in at least one degree, and teach composition as part of their 
assigned responsibilities. Nominations may include secondary school teachers who teach composition at the junior 
or senior year level in high schools and faculty who teach composition in private postsecondary institutions. 
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Upon receiving nominations from department chairs, .lie chief reader and the Statewide Test Administrator 
ask each nominee interested in becoming a reader to complete and submit an application form. The forms are used 
to determine whether applicants meet the criteria for readers, 

M Reporting Test Results 

The reports outlined below are generated for each adrninistration. In addition to these reports, institutions 
may request from the Statewide Test Adrninistrator a computer tape or diskettes containing their students' data, 
including item responses. Thus, institutions can generate their own reports and update files of students' records. 
A test blueprint giving item-skill correspondence and a data tape format are also provided to institutions. 

Student Reports 

The individual student report (Figure 3) and a score interpretation guide are mailed to students 
approximately five weeks after the examination date. A scaled score is reported for each subtest taken. In the boxes 
to the right of the scale score is reported the percentage of items correct in each brood skill area. Although the 
percentages are reported to the student, they do not become part of the student's transcript. The percentages help 
students determine their relative strengths and weaknesses in the broad skill areas represented on the test, but 
because of differences in the number of test items used in the broad skill areas, the percentages cannot be averaged 
to determine the overall percentage of correct responses. 

Preliminary Reports-prepared at the state and institutional levels 

1. Summary statistics (means, medians, and standard deviations) and frequency distributions of scores 
by 

a. Student classification: 

Community college A.A. program 
Community college A.S. program 
University native student 
University transfer student 

b. Racial/ethnic classification: 

White/non-Hispanic 
B lack/non-H ispan ic 
Hispanic 

American Indian/ Alaskan native 
Asian/Pacific Islander 
Non-Resident Alien 

c. Gender by xaciai/ethnic classification 

2. Alphabetic roster of examinees' scores 

Final Reports-prepared at the state and institutional levels 

1 . Means and percents of first-time examinees meeting current standards for 

a. students with 60 or more hours 

b. students with fewer than 60 hours 

(cont. p. 29) 
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Individual Score Report 
COLLEGE-LEVEL ACADEMIC SKILLS TEST 

DATE OF EXAM: 



S.S. * 



INSTITUTION 



Below are your scores on the College-level Academ.c Skills Test in the shaded box on the left is your essay score in the 
remaining shaded boxes are your three-digit scale scores for each subtest After each scale score is printed the following 
information for each broao sk-" area of a subtest number of correct answers number of questions - percentage of items 
answered correctly. The enclosed interpretation guide will help you understand your scores 

This report is only for your information The ofticinl record of your scores is kept by your institution on your tranr.cnpt 



ESSAY 



ENGLISH LANGUAGE SKILLS 



READING 



Word 
Cholca 


Sentence 
Structure 


Grammar. 
Spelling. 
Punctuntlon. 
Capitalization 











Comprehension 
Literal Critical 

1 









MATHEMATICS 



P^C^fei|- Arithmetic : Algebra i Geometry - 
^-.PvWlIi; ! I i Meeeurement 

m - i 


Logical 
Reesonlng 


Statistics 


Hi ! ! 







Paaelng ecoree on CLAST hava b«en aetebllehed by the Stete Boerd of Educetlon a« fcllowe: 

Eunii«;h L nnauno' 1 
tf,S; 'y Skins Handing Mniriomntic. 



Studente ere required to me«t the etenderde In effect et the time they flret took the test. 
It you have questions ebout your scores, you should contect: 



Figure 3. Copy of a blank student report form. 
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no 


BEST COPY AVAILABLE 








1 ■■. e|| 



(from p. 27) 



c. state university native students 

d. state university transfer students 

e. students by gender and racial ethnic category for each institution, all public institutions, all private 
institutions, all community colleges, and all state universities 

2. Means and percents of first-time examinees meeting future standards by gender and racial ethnic 
category for each institution, all public institutions, and all private institutions 

3. Means and percents of retake examinees meeting required standards by gender and racial ethnic 
category for each institution, all public institutions, and all private institutions 

Statistical Reports-prepared at the state level only 

1. Rasch item calibrations and fit statistics 

2. Scaled score derivations 

3. Classical item analysis by racial/ethnic classification 

4. Item difficulty plots by gender and racial/ethnic classification 

5. KR-20 coefficients and SEM's for multiple-choice subtests 

6. Interrater reliability for essay scores 

7. Coefficient alpha by gender and racial/ethnic classification for essay scores 

U Interpreting and Using Scores 

CLAST scores are reported to indicate students* achievement of those skills upon which the test is based. 
The CLAST scaled scores, not the raw scores, for each subtest are used for tlis purpose since the scaled scores have 
been adjusted for differences in difficulty in test forms. A scaled score of 300, for instance, represents the same 
achievement level across forms but may require a higher raw score on an easier form than on a harder one. The 
same scaled score, then, represents the same level of achievement of the skills regardless of the test form taken. 

The use of CLAST scores is prescribed by Florida Statutes and Rules of the SBE. Use of scores prior to 
August 1, 1984, was limited to student advising and curriculum improvement. Since August 1, 1984, students in 
public institutions in Florida are required to have CLAST scores which satisfy the standards set forth in Rule 6A- 
10.0312, FAC, for the award of an associate in arts degree and for the admission to upper division status in a state 
university in Florida. However, students who have satisfied CLAST standards on three of the four subtests and 
who are otherwise eligible may be enrolled in state universities for up to an additional thirty-six semester credits 
of upper division coursework before they are required to pass the fourth subtest. 

Standards (passing scores) for the CLAST have been adopted by the SBE in Rule 6A-10.0312(1), FAC. 
The standards for each designated period of time are indicated in Chapter II. 

The CLAST was not developed to predict success in upper division programs, but to assess the level of 
achievement of the skills listed in Appendix A. Any use of the scores for selection of students for specific upper 
division programs must be empirically validated. 
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VI. SUMMARY OF 1992-93 RESULTS 



The results of CLAST administrations indicate the level of achievement of communication and computation 
skills by students in community colleges and state universities. Summary data presented in this section describe stu- 
dent performance on the CLAST as a whole and on each subtest. Summary data are based on only those students 
who were first-time takers in public institutions. 

The mean, standard deviation, and median of raw scores and scaled scores are reported by subtest for each 
administration (Table 1 1). For the group of all examinees, mean scaled scores for the Essay subtest were the same 
for the October, February, and June administrations. For the Reading and Mathematics subtests, mean scaled scores 
for the total population of examinees were generally consistent across administrations. Slightly greater variation 
among mean scaled scores is shown for the English Language Skills subtest, with the highest mean score occurring 
for the October 1992 administration. 

The percentage of examinees that passed the CLAST was 55 in October 1992, 56 in February 1993, and 
56 in June 1993 (Table 12). The passing rates for groups of students classified on the basis of gender or 
racial/ethnic background varied across administrations, ranging from a low of 23 % to a high of 64% (Table 12). 

Mean scores are reported for all students, for students grouped according to gender, for students grouped 
according to racial/ethnic background, for students in community colleges, and for students in the state university 
system. These means are provided separately for the Essay, English Language Skills, Reading, and Mathematics 
subtests and are found in Tables 13, 14, 15, and 16, respectively. 
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TABLE 11 












Raw and Scaled Scores, 1992-93 










(First-Time Examinees in Public Institutions) 








■ ; . no. of.: -.■ 


RAW SCORE 


SCALED SCORE 


















ITEMS 


MEAN 


SIX). DEV. 


MEDIAN 


MEAN 


SID. DEV. 


MEDIAN 


ESSAY 
















October 










7.6 


2.0 


8 


February 










7.6 


1.9 


8 


June 










7.6 


1 0 


o 
0 


Enzlish Language Skills 
















October 


35 


29.4 


3.8 


30 


3235 


33.2 


321 


February 


35 


29.1 


3.6 


30 


316.7 


29.1 


318 


June 


35 


29.2 


4.2 


30 


319.6 


31.0 


317 


Reading 
















October 


36 


27.4 


4.9 


28 


314.9 


27.9 


313 


February 


36 


28.6 


4.3 


29 


317.3 


27.3 


315 


June 


36 


28.0 


4.6 


29 


316.6 


26.9 


317 


Mathematics 
















October 


50 


38.0 


7.3 


39 


313.0 


30.3 


310 


February 


50 


36.9 


7.1 


38 


312.6 


30.2 


312 


June 


50 


37.1 


6.9 


38 


3^0.2 


28.1 


309 



TABLE 12 

Percentage of Examinees Passing All Four Subtests, 1992-93 
(First-Time Examinees in Public Institutions) 


EXAMINEE GROUP 


OCTOBER 


FEBRUARY 


JUNE 


Number 
Tested 


Percent 
Passing 


Number 
Tested 


Percent 
Passing 


Number 
Tested 


Percent 
Passing 


All 


15,646 


55 


18,309 


56 


10,219 


56 


Male 


6,654 


56 


8,000 


59 


4,281 


58 


Female 


8,992 


53 


10,309 


55 


5,938 


54 


White 


10,656 


64 


12,900 


64 


7,241 


64 


Black 


2,154 


30 


1,925 


28 


949 


23 


Hispanic 


1,818 


36 


2,322 


41 


1,403 


40 


Asian/Pacific Islander 


581 


45 


630 


48 


360 


42 


American Indian/Alaskan Native 


42 


57 


61 


46 


40 


60 


Non-Restdent Alien 


300 


37 


345 


40 


180 


32 


Unknown Race 


95 


52 


126 


42 


46 


54 


Community College 


8,230 


47 


11,178 


50 


7,488 


53 


State University System 


7,416 


63 


7,131 


66 


2,731 


62 



9 
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TABLE 13 

Essay Mean Scaled Scores, 1992-93 
(First-Time Examinees in Public Institutions) 




. OCTOBER ■ 


V FEBRUARY 






. ... Mean 


Number 






. ..Mean . 


All 


15,701 


7.6 


18,350 


7.6 


10,248 


7.6 


Male 


6,682 


IS 


8,014 


7.4 


4,294 


7.4 


Female 


9,019 


IS 


10^36 


7.8 


5,954 


7.8 


White 


10,693 


8.0 


12,932 


7.9 


7,256 


7.9 


Black 


2,15? 


6.8 


1,931 


6.8 


954 


6.7 


Hispanic 


1,827 


7.0 


2,323 


7.1 


1,408 


7.0 


Asian /Pacific Islander 


585 


6.9 


632 


6.8 


362 


6.6 


American Indian/Alaskan Native 


42 


7.7 


61 


8.0 


40 


8.1 


Non-Resident Alien 


302 


6.8 


345 


6.7 


181 


6.5 


Unknown Race 


95 


7.4 


126 


7.0 


47 


7.8 


Community College 


8,258 


7.4 


11,204 


7.4 


7,508 


7.5 


State University System 


7,443 


7.9 


7,146 7.9 


2,740 


7.9 



TABLE 14 

English Language Skills Mean Scaled Scores, 1992-93 
(First-Time Examinees in Public Institutions) 


EXAMINEE GROUP 


Oi^BER : 


FEBRUARY 


JUNE 


Number 


Mean 


Number 


Mean. 


Number 


Mean 


All 


15,708 


323 


18,361 


317 


10,246 


320 


Male 


6,683 


320 


8,025 


315 


4,295 


317 


Female 


9,025 


326 


10,336 


318 


5,951 


322 


White 


10,694 


331 


12,937 


321 


7,253 


325 


Black 


2,159 


306 


1,931 


301 


954 


299 


Hispanic 


1,832 


307 


2,326 


307 


1,408 


309 


Asian/Pacificlslander 


584 


315 


633 


310 


364 


310 


American Indian/Alaskan Native 


42 


329 


61 


311 


40 


321 


Non-Resident Alien 


302 


307 


346 


307 


180 


303 


Unknown Race 


95 


321 


127 


312 


47 


318 


Community College 


8,261 


318 


11,214 


313 


7,508 


317 


State University System 


7,447 


329 


7,147 


323 


2,738 


325 
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TABLE 15 

Reading Mean Scaled Scores, 1992-93 
(First-Time Examinees in Public Institutions) 




OCTOBER ; 


.: FEBRUARY^ ^ : 




: Number 




Number 


Mean 


Number 


. Mean 


All 


15,704 


315 


18^60 


317 


10344 


317 


Male 


6,682 


314 


8,023 


318 


4,292 


318 


Female 


9,022 


315 


10337 


316 


5,952 


315 


White 


10,694 


321 


12,938 


322 


7352 


322 


Black 


2,158 


299 


1,931 


301 


953 


298 


Hispanic 


1,830 


305 


2,325 


309 


1,408 


306 


Asian/Pacificlslander 


584 


305 


633 


308 


363 


303 


American Indian/Alaskan Native 


42 


323 


61 


318 


40 


321 


Non-Resident Alien 


301 


302 


345 


304 


181 


300 


Unknown Race 


95 


316 


127 


310 


47 


317 


Community College 


8,261 


311 


11,215 


314 


7,508 


315 


State University System 


7,443 


319 


7,145 


322 


2,736 


321 



TABLE 16 

Mathematics Mean Scaled Scores, 1992-93 
(First-Time Examinees in Public Institutions) 


EXAMINEE GROUP 


OCTOBER 


FEBRUARY 


JUNE 


Number 


Mean 


Number 


Mean 


Number 


Mean 


All 


15,678 


313 


18,347 


313 


10,244 


310 


Male 


6,668 


319 


8,019 


319 


4,292 


316 


Female 


9,010 


309 


10328 


307 


5,952 


306 


White 


10,674 


317 


12,927 


317 


7,259 


314 


Black 


2,159 


297 


1,928 


294 


950 


292 


Hispanic 


1,824 


303 


2,327 


304 


1,407 


302 


Asian/Pacific Islander 


583 


321 


632 


318 


362 


315 


American Indian/Alaskan Native 


42 


313 


61 


307 


40 


308 


Non-Resident Alien 


301 


316 


345 


317 


180 


312 


Unknown Race 


95 


311 


127 


309 


46 


305 


Community College 


8,244 


307 


11,206 


309 


7,502 


309 


State University System 


7,434 


319 


7,141 


319 


2,742 


314 
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APPENDIX A 



m CLAST Skills Tested, 1992-93 
Essay 

- Select a subject which lends itself to development. 

- Determine the purpose and the audience for writing. 

- Limit the subject to a topic which can be developed within the requirements of time, purpose, and 
audience. 

- Formulate a thesis or main idea statement which focuses the essay. 

- Develop the thesis by 

• providing adequate support which reflects the ability to distinguish between generalized and 
concrete evidence, 

• arranging the ideas and supporting details in an organizational pattern appropriate to the purpose 
and focus, 

• writing unified prose in which all supporting material is relevant to the thesis or main idea 
statement, and 

• writing coherent prose, providing effective transitional devices which clearly reflect the 
organizational pattern and the relationships of the parts. 

- Demonst ate effective word choice. 

• Avoid inappiopriate use of slang, jargon, cliches, and pretentious expressions. 

- Employ effective sentence structure. 

• Use a variety of sentence patterns. 

• Avoid overuse of passive construction. 

- Maintain a consistent point of view. 

- Use standard practice for spelling, punctuation, and capitalization. 

- Revise, edit, and proofread units of discourse to assure clarity, consistency, and conformity to the 
conventions of standard American English. 

English Language Skills 

Word Choice 

- Use words which convey the denotative and connotative meanings required by context. 

- Avoid wordiness. 



Sentence Structure 

- Place modifiers correctly. 

- Coordinate and subordinate sentence elements according to their relative importance, 

- Use parallel expressions for parallel ideas. 

- Avoid fragments, comma splices, and fused sentences. 

Grammar, Spelling, Capitalization, and Punctuation 

- Use standard verb forms. 

- Maintain agreement between subject and verb, pronoun and antecedent. 

- Use proper case forms. 

- Use adjectives and adverbs correctly. 
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English Lanffuage Skills (cont.) 



Grammar, Spelling, Capitalization, and Punctuation (cont.) 



Avoid inappropriate shifts in verb tenses. 
Make logical comparisons. 

Use standard practice for spelling, punctuation, and capitalization 



Reading 



Literal Comprehension 

- Recognize main ideas. 

- Identify supporting details. 

- Determine the meanings of words on the basis of context. 
Critical Comprehension 

- Recognize the author's purpose. 

- Identify the author's overall organizational pattern. 

- Distinguish between statement of fact and statement of opinion. 

- Detect bias. 

- Recognize the author's tone. 

- Recognize explicit and implicit relationships within sentences. 

- Recognize explicit and implicit relationships between sentences. 

- Recognize valid arguments. 

- Draw logical inferences and conclusions. 



Arithmetic 

- Add and subtract rational numbers. 

- Multiply and divide rational numbers. 

- Add and subtract rational numbers in decimal form. 

- Multiply and divide rational numbers in decimal form. 

- Calculate percent increase and percent decrease. 

- Solve the sentence "a % of b is c," where values for two of the variables are given. 

- Recognize the meaning of exponents. 

- Recognize the role of the base number in detennining place value in the base-ten numeration system. 

- Identify equivalent forms of positive rational numbers involving decimals, percents, and fractions. 

- Determine the order-relation between real numbers. 

- Identify a reasonable estimate of sum, average, or product of numbers. 

- Infer relations between numbers in general by examining particular number pairs. 

- Solve real-world problems which do not require the use of variables and which do not involve percent. 

- Solve real-world problems which do not require the use of variables and which do require the use of 
percent. 

- Solve problems that involve the structure and logic of arithmetic. 



Mathematics 
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(cont.) 



Algebra 



4 



- Add au l subtract real numbers. 

- Multiply and divide real numbers. 

- Apply the order-of-operations agreement to computations involving numbers and variables. 

- Use scientific notation in calculations involving very large or very small measurements. 

- Solve linear equations. 

- Solve linear inequalities. 

- Use given formulas to compute results when geometric measurements are not involved. 

- Find particular values of a function. 

- Factor a quadratic expression. 

- Find the roots of a quadratic equation. 

- Solve a system of two linear equations in two unknowns. 

- Use properties of operations correctly. 

- Determine whether a particular number is among the solutions of a given equation or inequality. 

- Recognize statements and conditions of proportionality and variation. 

- Identify regions of the coordinate plane which correspond to specified conditions and vice versa. 

- Use applicable properties to select equivalent equations or inequalities. 

- Solve real-world problems involving the use of variables, aside from commonly used geometric 
formulas. 

- Solve problems that involve the structure and logic of algebra. 
Geometry and Measurement 

- Round measurements to the nearest given unit of the measuring device. 

- Calculate distances, areas, and volumes. 

- Identify relationships between angle measures. 

- Classify simple plane figures by recognizing their properties. 

- Recognize similar triangles and their properties. 

- Identify appropriate units of measurement for geometric objects. 

- Infer formulas for measuring geometric figures. 

- Select applicable formulas for computing measures of geometric figures. 

- Solve real-world problems involving perimeters, areas, and volumes of geometric figures. 

- Solve real-world problems involving the Pythagorean property. 

Logical Reasoning 

- Deduce facts of set-inclusion or set non-inclusion from a diagram. 

- Identify statements equivalent to the negations of simple and compound statements. 

- Determine equivalence or nonequivalence of statements. 

- Draw logical conclusions from data. 

- Recognize that an argument may not be valid even though its conclusion is true. 

- Recognize valid reasoning patterns as illustrated by valid arguments in everyday language. 

- Select applicable rules for transforming statements without affecting their meaning. 

- Draw logical conclusions when facts warra it them. 
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Mathematics (cont.) 



Statistics, Including Probability 

- Identify information contained in bar, line, and circle graphs. 

- Determine the mean, median, and mode of a set of numbers. 

- Use the fundamental counting principle. 

- Recognize properties and interrelationships among the mean, median, and mode in a variety of 
distributions. 

- Choose the most appropriate procedure for selecting an unbiased sample from a target population. 

- Identify the probability of a specified outcome in an experiment. 

- Infer relations and make accurate predictions from studying statistical data. 

- Interpret real-world data involving frequency and cumulative frequency tables. 

- Solve real-world problems involving probabilities. 
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APPENDIX B 



B College-Level Academic Skills Project (CLASP) and 
State-Level Task Force Members, 1992-93 

CLASP Task Force Chair 

Donald Tighe, Valencia Community College 

CLASP Staff 

June Siemon, Department of Education 
Edward Croft, Department of Education 

Statewide Tes t Administrator (ST A) 
Jeaninne N. Webb, Director 

Office of Instructional Resources, University of Florida 

Standing Committee on Student Achievement 

Robert Stakenas, Chairperson, Florida State University 

Linda Adair, Gulf Coast Community College 

Laura Armesto, Barry University 

Richard Burnette, Florida Southern College 

Maura Freeberg, Keiser College 

Stephanie Gall, Bay County Schools 

Robert Judson, Jr., Pasco-Hemando Community College 

Lola Kerlin, Florida Atlantic University 

Stuart Lilie, University of Central Florida 

Jack McAfee, Indian River County Schools 

Lee Rowell, Orange County School District 

Theodore Wright, Broward Community College 

Communication Task Force Members 

Beth Novinger, Chairperson, Tallahassee Community College 

Robert Fitzgerald, South Florida Community College 

Zella Glazer, School Board of Dade County 

Betsy Griffey, Florida Community College at Jacksonville 

Iris Hart, Santa Fe Community College 

Ann Higgins, Gulf Coast Community College 

Jose Marques, Florida International University 

Alina Rodriguez, Miami Edison High School 

Patricia Stith, Florida State University 

Donald Tighe, Valencia Community College 

Donna Warford-Alley, Daytona Beach Community College 

Willa Wolcott, University of Florida 
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Mathematics Task Force Members 



Karen Walsh, Chairperson, Broward Community College 

Barbara Cotton, Florida A & M University 

Larry Eason, Pasco-Hernando Community College 

George Green, Flagler College 

Linda Gtiilfoyie, Manatee County School Board 

Mary Henderson, Okaloosa-Walton Community College 

Mary Holton, Central Florida Community College 

Patricia Kelley, University of Miami 

Gayle Kent, Florida Southern College 

Charlene Kincaid, Gulf Breeze High School 

Leonard Liplrin, University of North Florida 

Charles Miles, Hillsborough Community College 

Judy Miller, Leon High School 

Peter Rautenstrauch, University of Central Florida 
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APPENDIX C 



MItem Review Guidelines 

Overall Factors to Consider in Critiquing Items 

1. Adequate measurement of skill 

2. Fairness of items: Items should be free of racial, ethnic, sexual, regional, and cultural bias. 

3. Quality of stimulus materials (graphics and/or text that provide information required for responding 
to the item): Content should be 

a. pertinent and appropriate for the grade level, 

b. clear and understandable, 

c. believable and realistic, and 

d. familiar to students of all racial/ethnic backgrounds. 



4. Quality of answer choice: The item should contain 

a. one and only one correct answer, neither too obvious and easy nor too difficult and obscure, 
and 

b. good distractors, neither too obviously incorrect nor too 
closely related to the correct answer. 

5. Readability of items and instructions: Readability should follow guidelines set forth in the test item 
specifications. 

6. Quality of language: The language used should be 

a. clear and concise, 

b. appropriate for the grade level, 

c. appropriate for students of all racial/ethnic backgrounds, and 

d. neither too formal and stilted nor too informal and colloquial. 



7. Technical considerations: Items should be free from flaws such as 

a. too much variation in the length of response options, 

b. clues in the stem which point to the correct answer, 

c. unclear wording of the stem or directions, 

d. confusing use of negative words in the stem, :md 

e. misleading directions— e.g., asking the student to choose the correct answer when the best an- 
swer is really called for (as in choosing the best inference, or the evidence that best supports 
a given inference), or vice versa. 



Questions to Consider in Critiquing Item Construction 
1. Stimulus/stem 

a. Does the stem provide ALL THE INFORMATION necessary to answer the question? 

b. Is the desired response evident by reading the stem alone? 

c. Is the stem written in the POSITIVE (avoiding not, except , etc.)? 

d. Is the stimulus portion of the item consistent with the Stimulus A ttributes ? 
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2. Response options 

a. Is there an appropriate number of options, arranged in a LOGICAL ORDER (e.g. , numerical, 
alphabetical, chronological)? 

b. Are the options grammatically and conceptually PARALLEL? 

c. Do the options AGREE grammatically with the stem? 

d. Are the options similar and appropriate in LENGTH? 

e. Do the options embody COMMON ERRORS and are they PLAUSIBLE? 

f. Do the options AVOID "all of the above* or "none of the above"? 

3. The entire item 

a. Does the item avoid tricky words, phrases, and constructions? 

b. Is the item free of superfluous material and awkward wording? 

c. Does the item avoid unnecessary clues? 

d. Does the item focus on IMPORTANT aspects of content, not trivia? 



Considerations in Critiquing Items for Bias 

An item is considered to be biased if it contains any language or vocabulary that could benefit or hinder 
any group's performance. When reviewing an item for bias, one must consider all of the following groups: 



As you review each item, consider each of the following questions: 

1. Does the item contain any information that could seem offensive to any group? 

2. Does the item include or imply any stereotypic depiction of any group? 

3. Does the item portray any group as degraded in any way? 

4. Does the item contain any group-specific language or vocabulary (e.g., culture-related expressions, 
slang, or expressions) that may be unfamiliar to particular examinees? 



females 
males 

racial/ethnic groups 
cultural groups 
age groups 

socio-economic groups 



the visually impaired 
the hearing impaired 



persons with other handicaps 



regional groups within the U.S. 



international groups 
religious groups 
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APPENDIX D 



M Test-Retest Reliability of the CLAST 

In 1984, the Department of Education contracted with Dr. F. J. King of the Florida State University to 
study certain aspects of the reliability of the College-Level Academic Skills Test (CLAST). Dr. King prepared a 
report entitled "A Test-Retest Study of the Reliability of the College-Level Academic Skills Test." The study is 
available from the Department of Education and is summarized herein. 

Dr. King invited 360 students who had taken the CLAST in September 1984 to take the CLAST 
examination a second time. Two hundred seventy-four agreed to do so, and 220 usable scores were obtained. The 
students were retested in October 1984 with the same form of the test which had been administered in June 1984. 

The data were analyzed using several statistics. A Hambleton-Novick (1973) index was calculated to 
obtain an estimate of the decision consistency over two test forms. The Brennan-Kane (1977) index was used to 
obtain an index of decision consistency for a single test administration. The KR-20 (Stanley, 1971) index was also 
calculated because it is a reliability coefficient widely used with norm-referenced tests. 

The Hambleton-Novick index calculated with the 1984 passing criteria resulted in the following: 

Computation 0.97 

Reading 0.86 

Writing 0.96 

Essay 0.86 

The Brennan-Kane indices for the subtests were as follows: 

Computation 0.96 

Reading 0.96 

Writing 0.92 

Essay not applicable 

The KR-20 internal consistency coefficients for the subtests resulted in values of: 

Computation 0.83 

Reading 0.87 

Writing 0.74 

Essay not applicable 

The reliability coefficients varied depending on which test administration was being analyzed, the relative 
difficulty of the tests, and the psychometric characteristics of the tests themselves. Further, it must be recognized 
that the reported reliability coefficients will vary for subpopulations (e.g., Hispanic) and will vary depending on 
the placement of the passing criterion. 
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